Landscape of Standing Variation for Tandem Duplications in Drosophila yakuba and Drosophila simulans
نویسندگان
چکیده
We have used whole genome paired-end Illumina sequence data to identify tandem duplications in 20 isofemale lines of Drosophila yakuba and 20 isofemale lines of D. simulans and performed genome wide validation with PacBio long molecule sequencing. We identify 1,415 tandem duplications that are segregating in D. yakuba as well as 975 duplications in D. simulans, indicating greater variation in D. yakuba. Additionally, we observe high rates of secondary deletions at duplicated sites, with 8% of duplicated sites in D. simulans and 17% of sites in D. yakuba modified with deletions. These secondary deletions are consistent with the action of the large loop mismatch repair system acting to remove polymorphic tandem duplication, resulting in rapid dynamics of gain and loss in duplicated alleles and a richer substrate of genetic novelty than has been previously reported. Most duplications are present in only single strains, suggesting that deleterious impacts are common. Drosophila simulans shows larger numbers of whole gene duplications in comparison to larger proportions of gene fragments in D. yakuba. Drosophila simulans displays an excess of high-frequency variants on the X chromosome, consistent with adaptive evolution through duplications on the D. simulans X or demographic forces driving duplicates to high frequency. We identify 78 chimeric genes in D. yakuba and 38 chimeric genes in D. simulans, as well as 143 cases of recruited noncoding sequence in D. yakuba and 96 in D. simulans, in agreement with rates of chimeric gene origination in D. melanogaster. Together, these results suggest that tandem duplications often result in complex variation beyond whole gene duplications that offers a rich substrate of standing variation that is likely to contribute both to detrimental phenotypes and disease, as well as to adaptive evolutionary change.
منابع مشابه
Tandem Duplications and the Limits of Natural Selection in Drosophila yakuba and Drosophila simulans
Tandem duplications are an essential source of genetic novelty, and their variation in natural populations is expected to influence adaptive walks. Here, we describe evolutionary impacts of recently-derived, segregating tandem duplications in Drosophila yakuba and Drosophila simulans. We observe an excess of duplicated genes involved in defense against pathogens, insecticide resistance, chorion...
متن کاملProposal for the Sequencing of Drosophila yakuba and D. simulans
Overview Comparative genome sequencing has the greatest impact on biology when the targeted genomes impinge directly on analysis or interpretation of the human genome or the genome of a genetic model system. Comparative genomics may also shed light on the genetic and evolutionary mechanisms that determine genome organization and composition. The most obvious benefit of comparative genomics has ...
متن کاملProposal for Sequencing of the Drosophila yakuba and D. simulans Genomes
Overview Comparative genome sequencing has the greatest impact on biology when the targeted genomes impinge directly on analysis or interpretation of the human genome or the genome of a genetic model system. Comparative genomics may also shed light on the genetic and evolutionary mechanisms that determine genome organization and composition. The most obvious benefit of comparative genomics has ...
متن کاملTandem duplications lead to novel expression patterns through exon shuffling in Drosophila yakuba
One common hypothesis to explain the impacts of tandem duplications is that whole gene duplications commonly produce additive changes in gene expression due to copy number changes. Here, we use genome wide RNA-seq data from a population sample of Drosophila yakuba to test this 'gene dosage' hypothesis. We observe little evidence of expression changes in response to whole transcript duplication ...
متن کاملPopulation Genomics: Whole-Genome Analysis of Polymorphism and Divergence in Drosophila simulans
The population genetic perspective is that the processes shaping genomic variation can be revealed only through simultaneous investigation of sequence polymorphism and divergence within and between closely related species. Here we present a population genetic analysis of Drosophila simulans based on whole-genome shotgun sequencing of multiple inbred lines and comparison of the resulting data to...
متن کامل